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ABSTRACT 


Corethrogyne DC. (Asteraceae) is a controversial genus due to complex and confounding variability, and 
the subject of multiple taxonomic revisions. It was previously included within Lessingia Cham. The single 
species, C. filaginifolia Nutt., has high infraspecific variability. This study was conducted to better understand 
this infraspecific variability for C. filaginifolia var. incana (Lindl.) Canby and C. filaginifolia var. linifolia H. 
M. Hall. Barcoding loci from nuclear ribosomal DNA (nrDNA) and chloroplast DNA (cpDNA) were used 
along with morphological comparisons of voucher specimens. Based on two nrDNA loci, maximum 
parsimony, maximum likelihood, and Bayesian analyses, we detected five haplotypes of C. filaginifolia. The 
two focal varieties of this study, C. filaginifolia var. incana and C. filaginifolia var. linifolia, were 


indistinguishable at both, nrDNA and cpDNA loci. 
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Asteraceae is one of the largest families of 
Angiosperms with more than 23,000 species (Panero 
and Funk 2008). Rapid diversification approximately 
30 million years ago resulted in high morphological, 
chemical, and biological diversity (Jensen and Palmer 
1987). Phylogenetic relationships at the tribal and 
subfamily levels within the Asteraceae have been 
examined extensively (Funk et al. 2009), though 
confusion still persists for some major lineages 
(Panero and Funk 2008). Corethrogyne DC., com- 
monly known as sandaster, is one of several 
controversial genera in the Asteraceae. Its distribu- 
tion is restricted to parts of the western United States 
(California and southwestern Oregon) and Mexico 
(northern Baja California) (Markos and Strother 
2006; Baldwin et al. 2012) (Fig. la). 

The taxonomy of Corethrogyne has been revised 
frequently. In 1959, Keck first proposed three species 
in the genus. By 1960, however, Ferris suggested 
seven species. Munz followed in 1974 to clarify the 
taxonomy of C. filaginifolia Nutt. using only plant 
material from southern California, and presented 11 
varieties. Approximately two decades later, Lane 
(1992, 1993) unified the genus Corethrogyne with 
Lessingia Cham. based on morphological and 
chloroplast DNA similarities. Between 1993 and late 
1990s, various studies (Zhang and Bremer 1993; 
Nesom 1994; Morgan 1997) showed that Lessingia is 
closely related to Corethrogyne based on morphology 
and chloroplast DNA. More recently, however, 
Saroyan et al. (2000) published a revision presenting 
Corethrogyne as distinct from Lessingia based on the 
perennial life cycle and radiate inflorescence heads in 
Corethrogyne as opposed to an annual life cycle and 
sometimes discoid inflorescence in Lessingia. Nota- 
bly, the same authors also reported morphometric 
distinctiveness between northern and southern pop- 
ulations of C. filaginifolia in California based on the 


number of inflorescences per flowering stem and 
length:diameter ratio of the involucres. Subsequent- 
ly, in 2001, Markos and Baldwin provided genetic 
evidence to support the discrimination between 
Lessingia and Corethrogyne. In 2012, Markos and 
Strother recognized a single species of Corethrogyne, 
C. filaginifolia Nutt., without any infraspecific taxa. 
Simultaneously, California Native Plant Society 
(CNPS 2015) maintains two species within the genus 
Corethrogyne (C. leucophylla Jeps. and C. filaginifo- 
lia) and recognizes two varieties of C. filaginifolia: 1) 
C. filaginifolia Nutt. var. incana (Lindl.) Canby, and 
2) C. filaginifolia Nutt. var. linifolia H.M. Hall. 
Adding to the confusion further, CalFlora (2015) 
also recognizes the same two species (C. leucophylla 
and C. filaginifolia), but maintains four varieties 
within C. filaginifolia: 1) C. filaginifolia Nutt. var. 
californica (DC.) Saroyan, 2) C. filaginifolia Nutt. 
var. filaginifolia, 3) C. filaginifolia var. incana, and 4) 
C. filaginifolia var. linifolia. The three Corethrogyne 
taxa recognized by CNPS are also listed in the rare, 
and endangered plant inventory of CNPS (2015), but 
considering the taxonomic uncertainty within the 
genus and within the C. filaginifolia complex in 
particular, the conservation status of these taxa 
remains unclear (Markos and Baldwin 2001). 

Two of the recognized taxa, C. filaginifolia var. 
incana and C. filaginifolia var. linifolia, are given a 
rarity rank of 1.B1 (CNPS 2015). Both taxa are 
endemic to San Diego County and occur in coastal 
scrub and chaparral, but occupy different areas 
within this narrow geographic range (Munz 1974; 
CNPS 2015) (Fig. 1b). Corethrogyne filaginifolia var. 
incana is restricted to sandy soils of the Point Loma 
peninsula. Corethrogyne filaginifolia var. linifolia 
occurs on open bluffs and in shrubby habitats of 
Del Mar. 
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Maps showing the distribution of Corethrogyne filaginifolia and locations where individuals were sampled for this 


study. a) A map of California showing the counties hosting C. filaginifolia populations, and coarse distribution of C. 
filaginifolia within them indicated by hatching (CalFlora 2015). b) A map of San Diego County showing the approximate 
distribution of C. filaginifolia var. incana and C. filaginifolia var. linifolia (also from CalFlora 2015). Additionally, a 
sampling location within Torrey Pine State Reserve (TPSR) is shown where C. filaginifolia var. linifolia was sampled. c) A - 
map of Point Loma peninsula showing locations where the two varieties of C. filaginifolia were sampled; Cabrillo Drive 
(CabN, CabS), McClelland Road (McC), SPAWAR (Spa), Electron Drive (Elec), and National Park Service (NPS) 
represent the sampling sites for C. filaginifolia var. incana, while the location coded as Point Loma (PL) represent the 


sampling site for C. filaginifolia var. linifolia. 


However, both taxa have been reported (as 
collections and occurrences) from outside these core 
ranges (CalFlora 2015). CNPS (2015) reported that the 
distribution of C. filaginifolia var. linifolia extended to 
Rancho Santa Fe, La Jolla, Del Mar, Encinitas and 
San Luis Rey, and the distribution of C. filaginifolia 
var. incana included National City, Point Loma, 
Imperial Beach, Del Mar, La Jolla, and Rancho Santa 
Fe (Fig. 1b). Because both taxa (C. filaginifolia var. 


incana and C. filaginifolia var. linifolia) are present on 
the Point Loma peninsula where they may potentially 
hybridize, conservation concerns arose with respect to 
their taxonomic uniqueness. 

Corethrogyne filaginifolia var. incana can be distin- 
guished morphologically from C. filaginifolia var. 
linifolia by presence of distinct stout-stalked glands 
(Fig. 2a) on the involucral bracts and distal plant parts 
making them sticky to touch. In comparison, C. 
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FIG. 2. Morphological differences between involucral bracts of Corethrogyne filaginifolia var. incana and C. filaginifolia 
var. linifolia are shown. a) Involucral bracts of C. filaginifolia var. incana are covered with stout-stalked glands. b) Involucral 


bracts of C. filaginifolia var. linifolia have tomentose surfaces. 


filaginifolia var. linifolia has tomentose involucral 
bracts, lacking glands (Munz 1974) (Fig. 2b). Munz 
(1974) reported additional morphological and pheno- 
logical differences to discriminate between the two taxa 
which included; 1) plant height (C. filaginifolia var. 
incana = 50-80 cm, C. filaginifolia var. linifolia = 20-40 
cm), 2) involucre length (C. filaginifolia var. incana = 
10-12 mm, C. filaginifolia var. linifolia = 8-10 mm), 
and 3) leaf shape (C. filaginifolia var. incana = linear to 
narrowly lanceolate, C. filaginifolia var. linifolia = 
narrow with 1-2 mm width). Further, genetic differ- 
ences between the two varieties have not been 
investigated to test the taxonomic distinctiveness. 

We sought to assess whether the two varieties, 
Corethrogyne filaginifolia var. incana and C. filagini- 
folia var. linifolia, could be distinguished using 
multiple DNA barcoding loci (Hebert et al. 2003). 
Toward this end, we compared the two taxa using six 
genomic loci that have previously been used to 
discriminate taxa within the Asteraceae and in other 
plant families (Chase et al. 2007; Kress et al. 2007; 
CBOL Plant Working Group 2009; Chen et al. 2010; 
Gaoret'al: 2010): 

Further, to broadly compare with the Californian 
C. filaginifolia complex, we combined our data 


generated in this study (without vouchered speci- 
mens) from the two varieties with sequence data 
(with vouchered specimens) previously generated by 
Markos and Baldwin (2001, 2002) and from NCBI 
(PopSet 417346869). We hypothesized that: 1) 
Corethrogyne filaginifolia var. incana and C. filagini- 


folia var. linifolia are genetically distinguishable from 


each other at the selected barcode loci. We based this 
hypothesis on the observations of Munz (1974) and 
our personal observation of the morphological 
differences between the two varieties; and 2) individ- 
uals within the Californian C. filaginifolia complex 
show sub-specific variation based on a latitudinal 
gradient. This hypothesis was based on the morpho- 
metric study of Saroyandet al. (2000), which 
suggested the distinction between northern and 
southern populations of Corethrogyne in California. 


MATERIALS AND METHODS 


Sample collection and DNA extraction 


Three individuals of C. filaginifolia var. incana 
were sampled from each of the following six 
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locations: Cabrillo Drive (CabN, CabS), McClelland 
Road (McC), SPAWAR (Spa), Electron Drive 
(Elec), and National Park Service (NPS) on Point 
Loma peninsula, San Diego County (Fig. Ic). The 
sampled sites for C. filaginifolia var. incana were 
within approximately 0.5-5 km from each other. 

Three individuals belonging to Corethrogyne 
filaginifolia var. linifolia were sampled at each of 
the following two locations: Point Loma (PL) (a 
central site on the peninsula) and Torrey Pines State 
Natural Reserve (TPSR) in Del Mar, San Diego 
County (Fig. 1b and c). TPSR is approximately 25- 
26eem from PL. 

Up to 10 leaves were collected from each plant in 
May 2014 in order to obtain DNA for the barcode 
analyses. Plants were collected at reproductive stage 
when C. filaginifolia var. incana can be easily 
distinguished from C. filaginifolia var. linifolia by 
the presence of stout-stalked glands on its involucral 
bracts (Fig. 2). No plant materials were collected for 
herbarium voucher specimens. Leaf samples were 
stored at 4°C immediately upon collection. 

Leaves from each individual were washed and 
sliced into 1-2 mm sections upon arrival at the 
laboratory in Texas Tech University, Lubbock, TX, 
to prepare them for DNA extraction. DNeasy Plant 
Mini Kit (Qiagen, Hilden, Germany) was used to 
obtain DNA following the manufacturer’s protocol. 
DNA quantity and quality was checked using 
NanoDrop ND1000 (Thermo Scientific, Waltham, 
MA, USA) and confirmed by running 1 pl genomic 
DNA on a 1% agarose gel. 


PCR Amplification and Sequencing 


For C. filaginifolia var. incana and C. filaginifolia 
var. linifolia, we used four cpDNA (psbA-trnH, rbcL, 
rpoC1, ycf5), and two nr DNA (ITS and ITS2) plant 
barcoding loci that have previously been used to 
discriminate taxa within the Asteraceae and in other 
plant families (Chase et al. 2007; Kress et al. 2007; 
CBOL Plant Working Group 2009; Chen et al. 2010; 
Gao et al. 2010). These loci were amplified via 
polymerase chain reaction (PCR) in a Mastercycler 
EPGradient (Eppendorf, Hamburg, Germany) using 
genomic DNA. Primer sequences and PCR thermal 
profiles were obtained from Chen et al. (2010). For 
each reaction, a 25 ul mixture was prepared by using 
the Promega GoTaq Flexi DNA Polymerase reagent 
kit (Promega, Madison, WI, USA). Reagent concen- 
trations were 5 ul of 5X Green GoTaq Flexi Buffer, 
0.4 ul of 40 mM dNTPs (10 mM of each dNTP), 4 pl 
of 25 mM MgCl, 1 ul of each primer (10 uM), 0.25 
ul of 10 ug/l bovine serum albumin, 0.25 pl of SU/pl 
GoTaq DNA polymerase, 4 ul of 50-100 ng/ul DNA, 
9.1 ul of sterile HO. During optimization, PCR was 
first performed by using the thermal profiles recom- 
mended by Chen et al. (2010). The loci that did not 
amplify or amplified inconsistently were exposed to 
touchdown PCR by adding a +5°C range to the 
annealing temperature recommended by Chen et al. 
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(2010) for each primer. Amplicons were purified 
using DNA Clean and Concentrator 5 kit (Zymo 
Research, Irvine, CA, USA) and were sent for bi- 
directional sequencing in an ABI 3730x1 genetic 
analyzer (Applied Biosystems, Foster City, CA, 
USA) at the DNA Analysis Facility on Science Hill, 
Yale University, New Haven, CT. 

We then generated consensus sequences by using 
SeqTrace version 0.9.0 (Stucky 2012). Only reads 
with >100 bp length, and showing >50% overlap 
between forward and reverse primer reads were 
selected for further analyses. Quality trimming and 
base calling of the consensus sequences was subse- 
quently performed using the same software. The ends 
of consensus sequences were trimmed if they had at 
least two nucleotide bases exhibiting < 20 Phred 
quality score (Q) using 20 bp window. 


Reference Sequence Extraction from GenBank 


Previously generated nrDNA sequences of 3’ETS 
(30 sequences), 5’ETS (5 sequences), and ITS loci (31 
sequences) from vouchered plants of C. filaginifolia 
(Benson et al. 1999; Markos and Baldwin 2001, 2002; 
NCBI PopSet 417346869) were included in the 
phylogenetic analyses (Table 1). We also included 
3'ETS and ITS sequences from Lessingia arachnoidea 
Greene and L. glandulifera A. Gray to represent 
genera closely related to Corethrogyne (Markos and 
Baldwin 2001; Table 1). Additionally, previously 
generated cpDNA sequences from Bidens pilosa were 
included to represent an outgroup for psbA-trnH, 
rbcL, rpeCl, and ycf> (Chen et al. 2010). 


Sequence Alignment and Sequence Divergence 


All sequences were aligned, and consistency scores 
for the alignments were estimated in T-Coffee version 
11.00 (Notredame et al. 2000) with M-Coffee mode. 
The alignments of nrDNA and cpDNA loci were 
concatenated separately and partitioned into data 
blocks where each data block represented an 
individual locus. The protein coding loci rbcL, 
rpoCl, and ycf5 were further partitioned based on 
the codon position. The partitioned datasets were 
used to predict the best model of DNA evolution in 
Partition Finder version 1.1.1 using Bayesian Infor- 
mation Criterion (BIC) score (Lanfear et al. 2012). 
Sequence divergences were determined for each 
aligned locus using Kimura 2-parameter (K2P) 
DNA substitution model with gamma distribution 
after pairwise deletion of gaps in MEGA (Molecular 
Evolutionary Genetics Analysis) version 6.0 (Tamura 
et al. 2013). 


Phylogenetic Analyses 


Maximum parsimony (MP) trees were generated 
in PAUP* test version 4.0a146 (Swofford 2003) with 
a heuristic search and tree bisection-reconnection 
(TBR) algorithm of branch swapping. Sequence 
addition was random, and all characters were given 
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an equal weight. The ‘multrees’ option was kept 
ineffective, and 1000 bootstrap replicates were used 
to calculate the clade support values. Subsequently, a 
50% majority rule consensus tree was generated from 
the bootstrap replicates. Consistency index (CI) and 
Retention index (RI) were also calculated in PAUP“. 
Additionally, Maximum Likelihood (ML) analysis 
was conducted with RAxML-HPC2 version 7.4.2 
(Stamatakis 2006) by generating 1000 ML trees using 
the random parsimonious tree as the starting tree and 
using GTRGAMMAL as the DNA substitution 
model. One thousand bootstrap replicates were used 
to estimate clade support on the maximum likelihood 
tree. Finally, we performed Bayesian analysis using 
MrBayes version 3.2.5 (Ronquist and Huelsenbeck 
2003). The best DNA substitution models were 
selected based on Partition Finder results, and prior 
probabilities were set to the default values for 
selected models. Two runs were initiated with 
random trees, and four chains were included in each 
run of which three were heated and one was cold. 
Analysis was carried out for >1 million generations 
with the temperature set to 0.2 for sufficient 
swapping between hot and cold chains. Cold chains 
were sampled after every 10,000 generations and 
cross diagnosis of two runs was carried out after 
every 500 generations. Tracer version 1.6 (Rambaut 
and Drummond 2007) was used to determine the 
number of generations with consistent marginal 
probabilities. To. compute the consensus Bayesian 
tree, results from initial 25% generations were 
discarded owing to their unstable posterior proba- 
bility values. Fig Tree version 1.4.2 (Rambaut 2007) 
was used to visualize the phylogenetic trees rooted 
with their respective outgroups. 

Here we report the results of MP, ML, and 
Bayesian analyses based on combined 1) GenBank 
extracted nrDNA loci sequences of C. filaginifolia, 2) 
cpDNA loci from C. filaginifolia var. incana and C. 
filaginifolia var. linifolia individuals. 


Statistical Analyses 


We used sequences downloaded from GenBank 
representing C. filaginifolia specimens from across 
California to conduct an analysis of molecular 
variance (AMOVA) to determine haplotype diversity 
in GENALEX version 6.5 (Peakall and Smouse 
2006). The sequences were divided into four groups 
based on ecoregions and sub-ecoregions of California 
(McNab et al. 2007). Sequences representing Del 
Norte, Mendocino, and Marin counties were catego- 
rized as a single group from Northern California 
Coast region. Those from Santa Clara, Santa Cruz, 
Merced, San Benito and Monterey counties were 
clustered together representing the Central California 
Coast and Coast Ranges region. Sequences generated 
from individuals collected inside Fresno, Kern, 
Madera, and Mariposa counties were placed under 
Great Valley and Sierra Nevada region, whereas 
sequences from Los Angeles, San Bernardino and 
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San Diego were clustered into Southern California 
Coast, Mountain and Valley region. A Mantel test 
was performed with 999 randomized permutations to 
compare the genetic distances among C. filaginifolia 
individuals with corresponding geographic distances 
in GENALEX. 


RESULTS 


Amplification success varied across loci whereby 
ITS, rpoC1, and ycf5 loci amplified in 95% of the 
samples; ITS2 amplified in 80% of the samples; 
psbA-trnH and rbcL amplified in 60% of the samples 
(Table 2). Amplicon lengths for each locus were 
comparable to sequences generated in previous 
studies using the same primer pairs (Kress et al. 
2805: Chen et al. 2010) (Table 2). 


Sequence Alignment, Partition, and Divergence 


The overall alignment consistency score was 99 for 
each locus. The combined alignment of nr DNA loci 
(3’ETS, 5’ETS, and ITS) based on 31 sequences of C. 
filaginifolia consisted of 1884 characters. The best 
scheme showed only one partition for nr DNA 
dataset and K80HG DNA evolution model. The 
cpDNA alignment based on 20 sequences consisted 
of 2338 characters and the best scheme showed two 
partitions for this alignment. The first partition 
consisted of all cpDNA coding loci (rbcL, rpoC1, 
and ycf5) with F81+G DNA substitution model. The 
second partition consisted of the non-coding cpDNA 
gene (psbA-trnH), which was placed under HYG+G 
DNA evolution model. The nrDNA loci containing 
sequences from across California (3’ETS, 5’ETS and 
ITS) exhibited higher average divergences (K2P = 
0.001--0.01) within C. filaginifolia in comparison to 
nrDNA loci (ITS2) and all cpDNA loci (K2P = 
0.0002--0.001) which contained sequences generated 
from C. filaginifolia individuals sampled within San 
Diego County (Table 2). 


Phylogenetic Analyses of nrDNA Loci 


Maximum parsimony analysis showed 9 of 1884 
characters to be parsimony informative for the 
ingroup samples. The average base frequencies were 
Av 012205526. =—0 26007" G = 0.26581; TEN 
With the outgroup taxa added, the program retained 
939 most parsimonious trees, and 941 tree islands 
were obtained from 1000 replications. The tree length 
of most parsimonious trees was 60. The rescaled 
consistency index and retention index were 0.9593 
and 0.9756, respectively, which provide support for 
divergent evolution of sequences. The consistency 
index excluding uninformative characters was 0.9583 
(Fig. 3a). Maximum likelihood analysis of the 
partitioned dataset yielded 1000 trees with the 
maximum likelihood score range of —3057.920 to 
—3057.928. The best tree was used to compute 
bipartitions with 1000 bootstrapped replicates, and 
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TABLE 1. CONTINUED 


Sample_ID 
Cf WeleNorte 1 


Cf Mendocino 1 


Cf Marin 1 

Cf Los Angles 1 

Cf San Diego 8 

Cf Mariposa 1 

Cf Santa Clara 2 

Cf Merced 1 

Cf San Benito 1 

Cf Same 
Bernardino 3 

Cf Monterey 2 


Cf San Diego 10 
Cf Mendocino 3 
Cf Santa Cruz Il 
Cf Mendocino 2 
Cf Santa Clara 1 
Cf San Diego 9 
Cf Bern..2 
Cf Madera 1 
Lessingia_ 
arachnoidea 1 
Lessingia_ 
arachnoidea_2 
Lessingia_ 
glandulifera 1 


Taxon 


C. filaginifolia 


C. filaginifolia 


C. filaginifolia 
C. filaginifolia 
C. filaginifolia 
C. filaginifolia 
C. filaginifolia 
C. filaginifolia 
C. filaginifolia 
C. filaginifolia 


C. filaginifolia 


C. filaginifolia 
C. filaginifolia 
C. filaginifolia 
C. filaginifolia 
C. filaginifolia 
C. filaginifolia 
C. filaginifolia 
C. filaginifolia 
Lessingia 
arachnoidea 
Lessingia 
arachnoidea 
Lessingia 
glandulifera 


Collector 
and 
Collection 
Number 


Parks & Tracy 


11496 
Bacigalupi 
8993 


Robbins 3891 


Markos 193 


Moran 28334 


Rose 60125 
Ewan 8119 
Lyon 1572 
Lyon 1428 
Jones 7239 


Bacigalupi 
2692 

Reveal 2754 
Moore 1018 
Markos 116 
Semple 8536 
Markos 112 
Markos 144 
Markos 159 
Markos 163 
Markos 210 


Markos 221 


Twisselmann 
13730 


Herbarium 
accession 
number 


JEES 15871 
JEPS 35847 


JEPS 22067 
JEPS 126514 
SD 105277 
JEPS 26049 
UC 521768 
UC 971972 
UC 971971 
UC 1544816 


UC 708668 


SD 110423 
JEPS 126518 
JEPS 88053 
UC 1557828 
JEPS s.n. 
JEPS 88048 
JEPS 126511 
JEPS 126510 
JEPS 126509 


MAGS 126517 


JEPS 54444 


GenBank Accession numbers 


ITS 
JX680484. 1 


JX680485. 1 


JX680486. 1 
JX680487.1 
JX680488.1 
JX680489. 1 
JX680493.1 
JX680492.1 
JX680495.1 
JX680496. 1 


JX680497.1 


JX680498.1 
JQO11998.1 
AF291093 
AF251594 
AF251529 
AF251590 
AZ SOO 
Al? ieee 
JX680490.1 


JX680491.1 


JX680494.1 


3 ETS 
JX680511.1 


JX680512.1 


JX680513.1 
JX680514.5 
JX680515.1 
JX680516.1 
JX680520.1 
JX680519.1 
JX680522.1 
JX680523.0 


JX680524.1 
JX680525.1 


AF251651 
AF251652 
AF251647 
AF251648 
AF251649 
AF251650 
JX680517.1 


JX680518.1 


JX680521.1 


STETS 


AF322271 
AF322272 
AF322267 
AF322268 


AP32226) 


OCT 


po 


ONOYXUOVIN 


79 [OA] 


2017] : KAUR AND SHARMA: PHYLOGEOGRAPHY OF CORETHROGYNE FILAGINIFOLIA 131 


TABLE 2. Amplification success of nuclear and chloroplast DNA barcoding loci used for resolving the phylogeny of 
Corethrogyne filaginifolia. Amplicon length represents the range of sequence length across all individuals of C. filaginifolia. 
Nuclear ribosomal DNA (nr DNA) loci 3’ ETS, and 5’ ETS represent GenBank extracted sequences of C. filaginifolia, while 
the ITS locus represents both GenBank (31), and self-generated (19) sequences. The ITS2 locus of nrDNA and all four 
chloroplast DNA (cpDNA) loci represent self-generated sequences. Pairwise sequence distances were calculated using 
Kimura 2-parameter (K2P) DNA substitution model with gamma distribution after pairwise deletion of gaps. 


PER Sequencing Amplicon Mean Range of 
Locus success (%) success (%) length (bp) n K2P distance K2P distances 
nrDNA 
3’ EMS . n/a n/a n/a 30 0.0031 0.000--0.012 
5’ BARS n/a n/a n/a 5 0.0154 0.004—0.033 
ITS 95 100 659-735 50 0.001 0.000--0.005 
ITS2 | 80 100 436-463 16 0.0003 0.000--0.002 
cpDNA 
ubel 60 100 | 659-712 12 0.0002 0.000--0.001 
rpoCl 95 100 506-510 19 0.0009 0.000--0.004 
ob] 95 100 383-402 19 0.0006 0.000--0.005 
psbA-trnH 60 100 390-415 11 0.001 0.000--0.007 
a Cf Del Norte | 


_707630.99 Hi 


mrenani {°F NAGNAGTINO 2 


aNG f Mendog ino 3 
arames aaa (fF aa | ]H2 
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FIG. 3. a) Phylogeny of Corethrogyne filaginifolia. The tree was constructed using partitioned dataset that included 
previously generated GenBank reference sequences from ITS, 3’ETS, and 5’ETS loci from nuclear ribosomal DNA 
(nr DNA). The clade support values from maximum parsimony, maximum likelihood, and Bayesian analyses are shown for 
each clade (an * indicates <50% bootstrap support). The potential haplotype clades are indicated with text. b) California 
map showing the counties and locations of identified haplotypes, each sampling location is not visibly identifiable by text on 
the map because of the overlap. 
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TABLE 3. A summary of transitions in ITS and transversions 
in 3’ ETS locus alignments among Corethrogyne filaginifolia 
individuals. The length of the aligned ITS locus was 630 
characters and the length of 3’ ETS locus was 565 characters. 
Transitions and transversions are in bold-faced font 
according to the characterization of different haplotypes. 
Haplotype5 is used as a reference character state due to its 
occurrence in majority of the sequences. The character 
(asterisk*) in haplotype2 differentiates it from haplotypel. 


Base position in alignment 


ITS 3 MEDS 
Individuals 388 619 240 273 325 396 
Haplotypel T A C C G G 
Haplotype2 T Gr C € G G 
Haplotype3 C G G A G TI 
Haplotype4 C G G A C G 
Haplotype5 C G G A G G 


the resultant consensus ML tree was obtained with 
<50-100% clade support values (Fig. 3a). Addition- 
ally, the Bayesian analysis showed convergence of 
two independent runs after 10 million generations 
with <0.01 average standard deviation of split 
frequencies, >200 Expected Sample Size (ESS) and 
>1 Potential Scale Reduction Factors (PSRF). 
Marginal likelihood scores of the two runs were very 
similar (-3081 and -3105) suggesting the search 
reached convergence. Of the 64 partitions, 30 
partitions were informative with 50.1 probability 
of occurrence across the two runs (Fig. 3a). All three 
phylogenetic analysis methods were consistent with 
respect to the relative placement of C. filaginifolia 
individuals (Fig. 3a). The C. filaginifolia clade 
revealed five haplotypes based on ITS and 3'ETS 
loci (Fig. 3a, Table 3). These haplotypes resulted 
from two synapomorphies within ITS and four 
synapomorphies within the 3’ETS. Haplotype5 was 
chosen as» the Character reference state for the 
comparisons of synapomorphies (Table 3). The 
length of the aligned ITS locus was 630 characters 
and showed two transitions at the 388th and 619th 
base position. Haplotypel and haplotype2 were 
distinguished based on these two transitions within 
the ITS locus. The length of the aligned 3’ETS locus 
was 564 characters, and there were four transversions 
at 240th, 273th, 325th, and 396th base positions in 
the alignment. The two transversions at 240th and 
273th base positions in 3’ETS locus differentiated 
haplotypel and haplotype2 from the other three 
haplotypes. The single transversions at 325th and 
396th base position in 3’ETS locus distinguished 
haplotype3 and haplotype4, respectively. Addition- 
ally, the ITS tree including GenBank and sequences 
generated from this study grouped all C. filaginifolia 
individuals together to the exclusion of the outgroup 
samples. Nested within a largely unresolved clade of 
C. filaginifolia samples is a clade including individ- 
uals from haplotypel and haplotype2 which is 
supported by moderate to high credibility values in 
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all three phylogenetic analyses (MP = 100%, ML = 
63%, BF = #88) (Fig. 4). 

Analysis of Molecular Variance (AMOVA) 
showed significant differences in nrDNA haplotype 
diversity of C. filaginifolia from four ecoregions (p = 
0.001). The variation in haplotype diversity within 
ecoregions was 33%, whereas variation among 
ecoregions was 67%. The Mantel correlation be- 
tween genetic and geographic distances was 0.21 (p= 
0.01). 


Phylogenetic Analyses of cpDNA Loci 


Maximum Parsimony analysis in PAUP* showed 
seven out of 2338 DNA characters to be parsimony 
informative for cpDNA loci of the ingroup samples. 
The average base frequencies were A = 0.29771; C = 
0.18650; G = 0.19061; T = 0.32518. After addition of 
outgroup taxon, numerous equally parsimonious 
trees were retained (911), and 915 tree islands were 
found from 1000 replications. The tree length of the 
most parsimonious trees was 307. The rescaled 
consistency index, and retention index were 0.9611 
and 0.9643, respectively. The consistency index 
excluding uninformative characters was 0.9583 (Fig. 
5a). One thousand ML trees generated in RAXML 
had maximum likelihood scores ranging from 
—4151.323 to —4152.060. The best tree, with a 
maximum likelihood score of —4151.323, was used 
to assign clade support values from 1000 bootstraps. 
Most of the clade support values were less than 50% 
for the ML analysis (Fig. 5c). In the Bayesian 
analysis, two independent runs had good conver- 
gence with the average standard deviation of split 
frequencies <0.01 after 2 million generations (ESS > 
200, PSRF > 1). The marginal likelihood scores of 
the two runs were also homogenous (—4255 and 
—4256). A total 56 partitions were found from which 
28 partitions were found to be informative with >0.1 
probabilities of occurrence across the two runs (Fig. 
5b). None of the analyses showed genetic discrimi- 
nation between C. filaginifolia var. incana and C. 
filaginifolia var. linifolia. Unlike for the nrDNA data 
phylogenetic analyses, the topology was different for 
the each of the three methods. Also, the majority of 
the clade support values were <50% for ML and MP 
methods. ML, MP, and Bayesian analyses produced 
non-monophyletic varieties. In the cpDNA phyloge- 
netic analyses, individual relationships do not reflect 
geography, nor do the relationships reflect any 
morphological similarities. 


DISCUSSION AND CONCLUSIONS 


We report five haplotypes of C. filaginifolia based 
on nrDNA barcoding data generated from specimens 
collected across California. Of the five haplotypes 
detected, haplotypel and haplotype? together segre- 
gated as a separate lineage from the rest (Fig. 3a). 
Among the five individuals constituting haplotype] 
and haplotype2, three were from Mendocino County 
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Cf Del Norte 1 
Cf Mendocino 1 
Cf Mendocino 2 
Cf Mendocino 3 
Cf Marin ] 

Cf Fresno 1 

Cf Kern 1 
Cikem 2 

Cf Los Angles 1 
Cf Madera 1 

Cf Mariposa 1 
Cf Merced 1 

Cf Monterey 1 
Cf Monterey 2 
Cf San Benito 1 
Cf San Bernardino 1] 
Cf San Bernardino 2 
Cf San Bernardino 3 


Haplotypel 


] Haplotype2 


Cf San Diego 10 
Cf San Diego 2 
Cf San Diego 3 
Cf San Diego 4 
Cf San Diego 3 
Cf San Diego 6 
Cf San Diego 7 
Cf San Diego 8 
Cf San Diego 9 
Cf Santa Clara 1 
Cf Santa Clara 2 
Cf Santa Cruz 1 
Cf incana CabN1 
Cf incana CabS1 
Cf incana CabS2 
Cf incana CabS3 
Cf incana Elecl 
Cf incana Flec2 
Cf incana_Elec3 


Cf incana McCl 


Cf incana McC2 


Cf incana McC3 
Cf incana NPSI 
Cf incana NPS2 
Cf incana NPS3 


Cf incana Spal 


Cf incana Spa3 
Cf linifolia PL 
Cf_linifolia_TPSR1 

Cf linifolia TPSR2 

Cf linifolia TPSR3 
Lessingia arachnoidea | 
Lessingia arachnoidea 2 
Lessingia glandulifera 1 


FIG. 4. Phylogeny of Corethrogyne filaginifolia based on ITS locus from nuclear ribosomal DNA (nrDNA). The tree was 
constructed using previously generated ITS GenBank reference sequences, and ITS sequence generated in this study. The 
clade support values from maximum parsimony, maximum likelihood, and Bayesian analyses are shown. The tree showed 
clustering of C. filaginifolia var. incana, C. filaginifolia var. linifolia, and other GenBank C. filaginifolia individuals except the 
clade of haplotype] and haplotype2 individuals. Lessingia arachnoidea and L. glandulifera were used as outgroup species. 


(Bacigalupi 8993 [JEPS], Moore 1018 [JEPS], and 
Semple 8536 [UC]) one from coastal Del Norte 
County (Parks & Tracy 11496 [JEPS]), and one from 
Marin County (Robbins 3891 [JEPS]) (Fig. 3a and b). 
The populations of C. filaginifolia in Mendocino, Del 
Norte and Marin counties experience similar envi- 
ronmental conditions given their placement within 
the Northern Coast section of California’s coastal 
steppe, mixed forest and redwood forest province 
(McNab et al. 2007). lt is possible that these 
populations constitute a different phylogeographic 
taxon of C. filaginifolia. Moreover, the support 
values for this clade in the nrDNA cladogram were 
high (MP = 82%, ML = 80%, Bayesian = 1.00) (Fig. 
3a). Within this clade, an individual collected from 
Marin County (haplotype2) segregated from individ- 


uals representing Del Norte and Mendocino counties 
(haplotypel) due to lack of shared transition at 619th . 
base position in ITS locus (Table 3). However, we are 
reporting the five individuals belonging to haplo- 
typel and haplotype2 as a single phylogeographic 
taxon given their consistent transition and transver- 
sions at other positions in ITS and 3’ETS loci (Table 
3). Further, the morphology of a specimen collected 
from Marin County (Robbins 3891 [JEPS]) was 
similar to the vouchers representing Del Norte 
(Parks & Tracy 11496 |JEPS]), and Mendocino 
counties (Bacigalupi 8993 [JEPS], Moore 1018 
[JEPS], and Semple 8536 [UC]). Additional multiple 
samples of C. filaginifolia should be included from 
the southern limits of North Coast of California to 
document the genetic and morphological variation 
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FIG. 5. Phylogeny of Corethrogyne filaginifolia based on cpDNA, which included sequences generated in the present study 
from psbA-trnH, rbcL, rpoC1, and ycf5 loci with partitioned dataset. The name of each individual is followed by the 


bp] 


collection site code shown in Fig. 1 and Table 1. The trees provide topology and clade support values from, a) maximum 
parsimony (an “ indicates <50% bootstrap support), b) Bayesian analyses, and c) maximum likelihood analyses methods. 
Bidens pilosa was added as an outgroup species. The trees showed polytomy of C. filaginifolia var. incana and C. filaginifolia 


var. linifolia. 


within this region. The voucher specimens of 
haplotypel (Parks & Tracy 11496 [JEPS], Bacigalupi 
8993 [JEPS], Moore 1018 [JEPS], and Semple 5536 
[UC]), and haplotype2 (Robbins 3891 [JEPS]) showed 
clear morphological distinctions in foliage when 
compared to other haplotypes (Fig. 6). The leaves 
of haplotypel and haplotype2 are spatulate and 
apically serrate (Fig. 6a), whereas the leaves of 
haplotypes 3, 4, and 5 are obovate, oblanceolate to 
linear, and without serrations (Fig. 6b-d). Haplo- 
type3 was identified from specimens from Santa Cruz 


(Markos 116 [JEPS]), Santa Clara (Ewan 8119 [UC]), 
San Benito (Lyon 1428 [UC]), Monterey (Bacigalupi 
2692 [UC]), and Merced (Lyon 1572 [UC]) counties, 
which fall under California central coast and coastal 
ranges ecoregions. The specimens from this region 
shared only a single transversion, and the clade 
support values for haplotype3 were low in both MP 
and ML analyses (51% and <50%, respectively) 
(Fig. 3a). Thus, we cannot be confident in haplotype3 
being a separate phylogeographic taxon. Although, 
the morphology of specimens in haplotype3 was 
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FIG. 6. Morphological variation among herbarium voucher specimens of Corethrogyne filaginifolia corresponding to the 
sequences from GenBank that we used as reference sequences to identify haplotypes. a) JEPS 15871, individual of C. 
filaginifolia from Del Norte County representing haplotype] and haplotype2. The plant shows primary growth of shoot and 
spatulate leaves that are apically serrated. b) UC 971972, individual of C. filaginifolia from Merced County, a representative 
of haplotype3. The plant shows secondary growth with outward growth of shoots, and obovate leaves with entire margins. 
c) JEPS 090954, C. filaginifolia individual from Fresno County, a representative of haplotype5. The plants have oblanceolate 
leaves. d) SD 110423, C. filaginifolia var. linifolia individual from San Diego County, a representative of haplotypeS with 
narrow leaves and tomentose involucral bracts. 
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distinct in terms of plant growth, haplotype3 
individuals exhibited more secondary growth along 
shoots in comparison to haplotypel, which showed 
mostly primary growth (Fig. 6a and b). 

Haplotype4 was represented by two specimens, 
one from San Diego County (Rich 109 [SD]), and the 
other from San Bernardino County (Markos 202 
[JEPS]). This haplotype was characterized by a single 
transversion in 3’ETS locus, and the clade support 
values were low in the parsimony and likelihood 
analyses (MP = 61%, ML = 61%), but high in the 
Bayesian analysis (BI = 0.93) (Fig. 3a). The 
representative specimen of haplotype4 from San 
Diego County (Rich 109 [SD]) was identified as C. 
filaginifolia var. linifolia; whereas, the varietal anno- 
tation of the representative individual from San 
Bernardino County could not be ascertained (Mar- 
kos 202 [JEPS]). Corethrogyne filaginifolia var. 
linifolia has not been reported outside San Diego 
County, thus the grouping of C. filaginifolia var. 
linifolia individual from San Diego County with C. 
filaginifolia individual from San Bernardino County 
does not allow for confident inferences about 
haplotype4. Moreover, individual Cf_San_Diego_10 
(Reveal 2754 [SD]) from haplotype5 was also 
identified as C. filaginifolia var. linifolia. Haplotype5 
was formed by individuals from Central Coast (CR 
338 [JEPS]), Great Valley and Sierra Nevada (Taylor 
717 [JEPS], Markos 158 [JEPS], Rose 60125 [JEPS], 
Lyon 1572 [UC], Markos 159 [JEPS], and Markos 
163 [JEPS]), Southern Coast, and Mountain Valley 
(Reiser s.n. [SD], Rebman 9514 [SD], Rebman 8252 
[SD], Rebman 9579 [SD], Gregory 1179 [SD], Wallace 
s.n. [JEPS], Markos 203 [JEPS], Markos 193 [JEPS], 
Moran 28334 [SD], Jones 7239 [UC], and Reveal 2754 
[SD]) sections of California’s ecoregions. The poly- 
tomy including haplotype5 could not be resolved 
despite its distinctive morphological variation, espe- 
cially in San Diego County (Fig. 3a). 

We did not detect genetic differences among 
individuals of C. filaginifolia var. incana and C. 
filaginifolia var. linifolia which were examined with 
both cpDNA and nrDNA data. The distribution of 
individuals of the two varieties within nr DNA (Fig. 
4) and cpDNA (Fig. 5) cannot be explained with 
respect to either their provenance or morphological 
characteristics. This result either implies the occur- 
rence of varying rates of mutation in the two varieties 
at these loci or the unsuitability of cpDNA and 
nr DNA loci to accurately explain the infraspecific 
variation for these two varieties. The discordance 
between gene trees and morphological distinctions of 
the two varieties can also be a result of incomplete 
lineage sorting. 

The genetic distinction of northern C. filaginifolia 
individuals (haplotypel and haplotype2) from the 
rest of the individuals at nrDNA loci supports the 
morphometric study of C. filaginifolia by Saroyan et 
al. (2000), and we provide evidence in our study to 
classify these individuals as a distinct variety. The 
Mantel correlation (77 = 0.21, p = 0.01) between 


[Vol. 64 


genetic and geographic distances also provided 
evidence that morphological variation might be 
linked to genetic variation along the latitudinal 
gradient in C. filaginifolia. Although morphometric 
differences also exist within and between other 
haplotypes of C. filaginifolia, the genetic analyses 
with universal plant barcoding loci could only 
provide strong evidence for haplotypel and haplo- 
typ2. We suspect that it might not be possible to fully 
resolve the infraspecific variation in C. filaginifolia 
using the standard plant barcoding loci. Although 
higher resolution polymorphic markers such as 
double digest restriction site associated DNA se- 
quencing (ddRAD-seq) of genomic simple sequence 
repeats (SSRs), single nucleotide polymorphism 
(SNPs), and transposable elements are some poten- 
tial approaches to differentiate the infraspecific 
variability within C. filaginifolia. 
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